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Abstract 

Q-i For a mean zero random variable Y with positive, finite variance a 2 , the variable 

Ph . Y* is said to have the Y-zero biased distribution if 

td : Wf{Y)] = a 2 E[f'(Y*)] 

for all absolutely continuous functions / for which these expectations exist. If Y and 
such a Y* can be constructed on a common space such that \Y* — Y\ < C for some 
C > 0, and the moment generating function m(s) = E[e ] of Y exists on [0, 1/C), 
then 

in i P(y > *) < exp (- 2((r2 * +Cf) ) for all t > 0, 

with the same bound holding for the left tail when the m{s) exists on (— 1/C, 0]. An 
improvement of these right and left tail bounds for large |i| holds when m(s) exists 
on [0, 2/C) and (— 2/C, 0], respectively. Applications considered include the sum of 
population characteristics in a simple random sample, and the Hoeffding statistic when 
the random permutation is uniform over the symmetric group, and when its distribution 
j^ I is constant on conjugacy class. 

1 Introduction 

The concentration of measure phenomenon for a random variable Y, that is, upper bounds 
for probabilities of the form P(|Y — E(Y)| > ty/Vax(Y)) that typically decay exponentially 
in t, has attracted the attention of many researchers, see the extensive treatments of [21], 
[j~6] and [3], for example. Starting with the work of [I], connections between concentration 
phenomenon and the couplings used in Stein's method, initiated in [19] and [2D] , see [5] and 
for overviews, have also attracted interest. In particular, in [1] the Stein exchangeable 
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pair coupling is applied to derive tail bounds for Hoeffding's combinatorial CLT and the net 
magnetization in the Curie- Weiss model. The size bias coupling, another popular technique 
from Stein's method, was put to use in [7], [8] and [1] to produce concentration bounds 
for the number of relatively ordered subsequences of a random permutation, sliding window 
statistics, the number of local maxima of a random function on a graph, degrees of random 
graphs, multinomial occupation models and coverage problems in stochastic geometry. 

Here we focus on the zero bias coupling, also borrowed from Stein's method and first 
introduced in [10], and show how it may be used to yield concentration bounds for the sum 
Y of population characteristics in a simple random sample, and for the Hoeffding statistic 
[13] of the form 

n 

Y = 2_^ a in(i) 
i=l 

depending on an array (Oy)i<ij< n of real numbers and a random permutation n. The 
Hoeffding statistic arises in many applications, and in permutation tests in particular, see 
[22] and [H], for instance. We provide concentration bounds for Y when the distribution of i\ 
is uniformly distributed over the symmetric group, and when its distribution depends only 
on conjugacy class. 

The concentration bound of Theorem 11.11 holds for any random variable for which a 
bounded zero bias coupling exists. We recall from [10] that for any mean zero random 
variable Y with positive, finite variance a 2 , there exists a distribution for a random variable 
Y* satisfying 

E[Yf(Y)} = a 2 E[f'(Y*)] (1) 

for all absolutely continuous functions / for which the expectation of either side exists. The 
variable Y* is said to have the Y-zero biased distribution. By Stein's lemma [19] it is known 
that a random variable Y is normal if and only if Y =^Y* . For the use of zero bias couplings 
to produce bounds in normal approximations see p)] and the references therein. 

Theorem 1.1 Let Y be a mean zero random variable with variance a 2 G (0, oo) and moment 
generating function m(s) = K[e sY ], and letY* have the Y -zero bias distribution and be defined 
on the same space as Y . 

If Y* — Y < C for some C > and m(s) exists for all s G [0, l/C), then for all t > 

r(Y £ " £ exp (-2KTO)) • (2) 

and the same upper bound holds for F(Y < —t) if Y — Y* < C when m(s) exists for all 
se (-1/C,0]. If\Y*-Y\ <C andm(s) exists on {-l/C, l/C) then P(\Y\ > t) is bounded 
by twice the right hand side of (TJ|). 

If \Y* — Y | < C for some C > and m(s) exists for all s G [0, 2/C) then for all t > 

t 2 



F(r>t)<exp^- i -^^j, (3) 

with the same upper bound holding for F(Y < —t) if m(s) exists in (— 2/C, 0], and if in 
(— 2/C, 2/C) then P(\Y\ > t) is bounded by twice the right hand side of |3j]. 



We remark that if \Y* — Y \ < C and m(s) exists in [0, 2/(7), then, for the right tail say, the 
bound (J2J) is preferred over (J3J) for \t\ < 2a 2 /C. In typical applications the variance of Y 
grows as 0(n), in which case the first set of bounds is preferred over a set increasing to M. 
asymptotically 

Comparisons between the bounds obtained by the well known Azuma-Hoeffding inequal- 
ity, the use of self bounding functions [17], and the size bias method of [7J and [8] are 
discussed in [TJ, where it is shown that the latter two methods may yield improvements of 
the former. In brief, when Y has mean \x and is a function of n independent random variables 
the Azuma-Hoeffding inequality yields a bound for the right tail of Y of the form 

P(y - A* > *) < exp (- 2E t_ J ) . (4) 

where q is a bound on how much Y can change upon varying the i th independent variable. 
In typical applications the order of the sum ^™ =1 c 2 may grow as 0(n). In contrast, the 
method of self bounding functions and the size bias method yield bounds that are expressed 
in terms of the mean /i, and in the latter case in particular yield the right tail bound 
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P(y-^ >«)<„,, (-j^l^) for all* >0, 



(5) 



and the left tail bound 



W(Y-fi<-t)< exp ( — j for all t > 0, (6) 

V 2(7/// 

where (7 is a constant that does not depend on n. 

There are regimes where the mean // grows at a slower rate than n, and where, conse- 
quently, © and ([6]) produce bounds that serve a needed purpose, but where (j4j) would not 
be adequate, see [TJ. To take one simple example from [1], when Y has the Binomial B(n,p) 
distribution, the bounds fl5]) and flBJ) hold with (7 = 1, and hence, in the case p G (0, 1/2], 
taking the minimum of the bounds for Y and n — Y, we obtain the right tail bound 

f cxp(- *' ) < t < nil - 2p) 

^-*^{„2-X) "«>™(i-^ (7) 

which is a strict improvement over the application of (jlj) in this case, where q must be taken 
to be one, and hence Ym=i c 1 = n - 

In contrast to the bounds fl5]) and fl6]), depending on the mean jjl of Y, the bound of 
Theorem 11.11 depends on the variance a 2 , and may lead to further improvements. Again 
taking the Binomial case to illustrate, setting W = Y — /i, a result we recall in Section [3] 
shows that when W is the sum of independent non-trivial centered Bernoulli variables, a W* 
having the iy-zero bias distribution can be constructed by replacing one centered summand 
Xi — p by an independent variable having that summands zero bias distribution. As (TTOj) 
gives that the replaced variable must obey the same boundedness condition as the original 
one, we obtain \Y* — Y\ < 1, and (j2j) and (J3J) respectively yield the right tail bounds 

P(F -np>t)< exp I J , P(F - np > t) < exp 




The first inequality in (JSJ) improves on the size bias bound (|7|) for < t < n(l — 2p), and the 
second for n(l — 2p) <t< 2n(l — p)(l — 2p) for p e (0, 1/2], with additional improvements 
obtained for more restricted ranges of p. In particular these bounds also improve on the 
Azuma-Hoeffding bound (jlj) for this case. 

Finally, we note that the conclusions of Theorem 11.11 remain true in the case C = 0, that 
is, when Y is a mean zero normal random variable. In this instance inequality ([2]) yields 

HY >t)< exp ( J for t > 0, 

which is well known to be optimal by the standard Mill's ratio inequalities. 

2 Proof of the main result 

We state two properties of the zero bias distribution that will be applied below. First, from 
(jTJ) , it is easy to see that whenever a ^ 0, we have 

(aY)* = d aY*. (9) 

Next, from [10], if Y is bounded by some constant, then Y* is also bounded by the same 
constant, that is, 

|Y| < c implies |Y*| < c. (10) 

We now present the proof of Theorem 11.11 
Proof: If the moment generating function m(s) of Y exists in an open interval containing 
s we may interchange expectation and differentiation at s and obtain 

m'(s) = E[Ye sY ] = a 2 E[se sY *} = a 2 sm*(s), (11) 

where we have set m*(s) = E[e sy *], and applied the zero bias relation (pQ) to yield the second 
equality. 

We first prove (El). Starting with the well known inequality 1 — x < e~ x , holding for all 
x > 0, we obtain 

e x <—^— for x e [0,1). 

1 — x 

Hence, for 6 e (0, I/O) and < s < 6 we have 

(s) = E[e sy *] = E[e s{Y *- Y) e sY ] < e sC m(s) < —^m(s). 

Using the identity ffTTl) to express m*(s) in terms of m'(s) we obtain 

a 2 s 

m'(s) < —m(s). 

1 — sO 

Dividing both sides by m(s), integrating over [0, 8] and using that m(0) = 1 we obtain 

,„, f 6 m'(s) J a 2 f e J a 2 6 2 

\ogm{9) = / -^ds < — / sds 



m 



m(s) ~ 1 - 6C J 2(1 - 6C) 



and exponentiation yields 



m(6) < exp 



a 
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As ([2]) holds trivially for t = consider t > and apply Markov's inequality to obtain 

/ rr 2 fl 2 



P(F > t) = P(e &y > e M ) < e- yt m(^) < exp -9t + 



2(1 -9C) 



Setting 9 = tj(a 2 + Ct), and noting this value lies in the interval (0, 1/C), ([2]) follows. 

If now F — Y* < C then letting X = —Y we see from (J9J) with a = — 1 that — F* has the 
X-zero bias distribution, and applying ([2]) for the coupling of X to X* = — F* now yields 
the claimed left tail inequality, and hence the bound for -P(|F| > t). 

Turning to ([3]), by the convexity of the exponential function we have 



oV _ P x /■! /■! „j/ I p x 



e t y+ (i-t)x dt < / ^ + ^ _ t ) e ^ rft = e +e f or a n x -L y, 



y-x 

and hence 

|y-a;|(ey + e 3! ) 
- 2 y 

In particular, as \Y* - Y\ < C, for all G (0, 2/C) and < s < 6, 

e sY* _ e sY < | g (F*-F)|(e* y *+e* y ) ^ Cs^, Y . + ^ (12) 



Taking expectation in (fl2|) yields 



and rearranging yields 



m*(s) — m(s) < — (m*(s) + m(s)), 



m*(s) < ( ^|^) m(s). (13) 



Applying relation (TTTj) in (1131) we have 

m'(s) < a 2 s I — ' ) m(s) 



:> { ' l + Cs/2 \ 

l-Cs/2) 



and dividing both sides by m(s) we obtain 



m'(s) 2 fl + Cs/2\ 2a 2 s 



m{s) \1 - Cs/2J ~ 1 - C6/2' 

Since m(0) = 1, integration on [0, 6 1 ] yields 



m(s) "Jo 1-C6/2 ' ' 1-09/2' 
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and exponentiating we obtain 

m{9) < exp ( j^j^ j for all 9 e (0, 2/C). (14) 

Noting that the result holds trivially for t = 0, fix t > and 9 e (0, 2/C). Arguing as 
before, (fl"4"l) and Markov's inequality yield 

P(Y > t) = P(e ey > e 94 ) < e^ 4 m(#) < exp ( -0i ^ '"" 



1 - Cfl/2y ' 

Letting 9 = t/(2a 2 + Ct/2), and noting that this value lies in (0, 2/C), we obtain the asserted 
right tail inequality ([3]). We now repeat the argument above to obtain the two remaining 
claims. 

3 Construction of zero bias couplings 

In this section we will review constructions of zero bias couplings in different settings. Details 
can be found in [3]. 

3.1 Zero biasing independent sums 

For illustration we consider zero biased coupling constructions in the simplest case, where Y 
is a sum of independent, mean zero random variables each having a finite, positive variance. 
As formalized in the following proposition, the zero biased distribution of such a sum can 
be constructed by choosing a summand with probability proportional to its variance, and 
replacing it with a variable from its own zero bias distribution. 

Proposition 3.1 Let A 1; . . . ,X n be independent, mean zero random variables with finite, 
positive variances a\ , . . . , a^ and set 



y = E x 



i=l 

Then with a 2 = a\ + ■ ■ • + a 2 and I a random index independent of X\, . . . ,X n having 
distribution 

p(/=o=4> ( is ) 

and X* having the Xi—zero bias distribution, independent of Xj,j ^ i and of I, the variable 

Y* = Y - X, + Xj 
has the Y -zero biased distribution. 



3.2 Use of the exchangeable pair 



The next construction, due to [10] . is based on the existence of A-Stein pairs, that is, ex- 
changeable variables (Y', Y") that satisfy the linearity condition 

E[y"|y'] = (1 - \)Y' for some A G (0, 1). 

Proposition 3.2 Let (Y', Y") be a X-Stein pair with Var(Y') = a 2 G (0, oo) and distribution 
F(y',y"). Then 



E[Y'] = and E(Y' - Y"f = 2\a 2 , 
and when (Y*, Y*) has distribution 

dF\y\y") = ^^y 2 dF{y\y"\ (16) 



(id 



d U ~ U[0, 1] is independent of (Y*, Y*), the variable 



Y* = UY ] + (1 - U)Y X (17) 

has the Y' —zero biased distribution. 



In the typical uses of the coupling construction suggested by Proposition I3.2[ including 
those in Sections I4.2[I4.3.H and H~3.2I below, given Y', we first construct Y" close to Y', such 
that (Y', Y") is a A-Stein pair, and use it to form the difference Y' — Y" that appears in 
( TT6l) . Then, perhaps independently, one constructs the parts of Y\Y* that depend on the 
"square biased" term (Y' — Y") 2 . Finally, one constructs the remaining parts of Y^Y^ by 
using, as much as possible, the corresponding parts of Y', Y" to achieve variables Y\ Y x that 
are close and have joint distribution dF^(y',y"). 

In many applications the construction just described results in S, a function of the 
variables which can be kept fixed, and variables T', T^ and T$, all on a joint space, such that 

Y' = S + T', Y^ = S + T\ and Y t = S + T t . (18) 

When T", T^ and T^ are all bounded by B, ( TT7I) gives 

\Y* - Y'\ = \UT ] + (1 - U)T X - T'\ < U\1*\ + (1 - U)\T X \ + \T'\ < IB. (19) 

4 Applications 

In all our applications that follow we consider random variables Y that are bounded, and 
hence make no further mention of the conditions on the moment generating function in 
Theorem 11.11 



4.1 Sum of independent random variables 

Applying the construction of Proposition 13.11 to a sum of bounded independent random 
variables we obtain a version of the classical Bernstein inequality. 

Theorem 4.1 Let Xi,...,X n be independent mean zero random variables with variances 
of = Var(Xi) G (0, oo), and that satisfy \Xi\ < c for all i = 1, . . . ,n. Then all the bounds 
of Theorem \l.l\ hold for Y = ^27=1 -^* w ^ a2 = Y17=i a 1> an ^ ^ re P^ ace d by 2c. 

Proof: By Proposition 13. 11 we can form Y* by replacing a single summand, selected indepen- 
dently with probability probability ( fl5l) proportional to its variance, by a random variable 
independent of the remaining summands and having that summand's zero bias distribution. 
Thus 

\Y*-Y\ = |X;-X 7 | <2c 

using (TTTJj) . and Theorem 11.11 now obtains. 

4.2 Simple random sampling 

Let A be a collection of real valued characteristics of a population of size N, not all equal, 
and let Y be the sum of the characteristics of a simple random sample of size n. With a 
the average of the elements of A, a straightforward calculation yields that the mean /x and 
variance a 2 of Y are given by 

_ 2 n(N — n) sr^, _ x2 

u = na and a = -rr—- > (a — a) . (20) 

Theorem 4.2 Let {Xi, . . . ,X n } be a random sample of size n from a set A of N real 
numbers, not all equal, with n and N satisfying 2 < n < N — 1. Then with Y = Y17=i-^-i' 
the bounds of Theorem M . 1\ hold with Y replaced byY — fi, the mean \x and variance a 2 given 
in (E2P, and 

C = 6 max lo — ol. 

Proof: Center by replacing Y and a e A by Y — /i, and a — a respectively. When 2 < n < 
N — 1, if {X', X", X 2 , . . . , X n } is a simple random sample of size n + 1 from A, it is shown 
in [12] that (Y', Y") given by 



Y' = X' + J2Xi and Y" = X" + J^ X i 



i=2 i=2 



form a A-Stein pair with A = N/(n(N — n)) 6 (0, 1). In addition, independently sampling 
X^,X$ according to the distribution 

q(a,b) = ^9-l({a,b}cA), 
one can find a zero bias coupling Y* to Y' satisfying the decomposition (1181) with 
S = J2 «, T' = X' + ^ a, T f = X f + J^ a and T * = xt + Yl a ' 
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where 

S = {X 2 ,...,X n }\{X\X i } and 71' = {X 2 , . . . ,X n } n {X\X*}, 
with 1Z< obtained by taking a simple random sample of size \1Z'\ from the complement of 

Q = {X 2 ,...,X n }U{X\X^}. 

As 1Z' and TV are of size at most 2, the variables T',!^ and T* are all bounded by B = 
3 max ae ^4 | a | . Hence by ffT9|) there exists a coupling of Y" and Y* with |Y* — Y\ < IB. 
Theorem 11.11 now obtains. 

4.3 Hoeffding's permutation statistics 

Let Ti be a random permutation in the symmetric group S n and A = (aij)i<ij< n an n x n 
matrix with real entries. Hoeffding's statistic, of the form 

n 

arises in a number of contexts, permutation testing problems foremost among them; see 
[22] for a seminal reference, and [9] and references within. Hoeffding's combinatorial central 
limit theorem [13] gives conditions under which Y, properly centered and scaled, has an 
asymptotic normal distribution. The rate of convergence of Y to its normal limit is well 
studied, see for instance [5] and references therein. Here we apply Theorem 11.11 to obtain 
concentration inequalities for Y. 

4.3.1 Uniform Distribution on Permutations 

First we consider the case where tx in (f2~TI) is chosen uniformly over S n . Letting 



i=i *=i *,i=i 

straightforward calculations show that the mean jjla and variance <y\ of Y are given by 

Ha = na.. (22) 



and 



a 



— T 5Z ( a *J ~~ a * 2 - _ a -J + a -) = T X^ ( ai i ~ °" L ~ a -i + a -'* 2 ' ( 2 ^ 



n 



Theorem 4.3 Lei n > 3 and A = {a>ij)i<i,j<n be an array of real numbers such that, with h a 
and o~\ given by |HP and l[2c>\) respectively, the variance a\ is non-zero. Let tc be a random 
permutation uniformly distributed over S n and Y be as in l[21\) . Then the bounds of Theorem 
li.il hold with Y replaced by Y — \xa, o~ 2 by a A and 

C = 8 max \dij — Oj.|. (24) 

l<«,jf<n 



Proof: Relabel Y and it as Y' and tc', respectively, and center by replacing Y' by Y' — fiA 
and Oy by a^ — Oj.. With r^ the transposition that interchanges % and j, that is, Tijii) = 
j,Tij(j) = i and Tij(k) = k for all k <£ {i,j}, it is shown in [TT] that by sampling I, J 
uniformly from {1, . . . , n} over all distinct pairs, independently of tt', that Y' and 



Y" = J2^y 



U) 



i=l 



form a A-Stein pair with A = 2/(n — 1) when tt" = ti'tjj. 

Following [TT], see also [5], and the outline given in Section I3~2"| to construct F^ and Y$, 
sample I', J\K' ,L\ independently of the remaining variables, from the distribution 

, , n _ [(fljfc + ag) - (og + ajfc)] 2 
An z [n — l)a z 

and set 

f 7rr,- 1( ^t),jt if L f = 7r(Jt), Kt ^ 7r(jt) 

vrt = ^ 7rT ff -i (£ t),/t if £ f 7^ vr(Jt), /ft = n (j\) 

{ 7TT, r -i(j r t) i jtT, r -i( L t) ) jt otherwise, 

and 7r^ = TrVjt jt. Then with X = {/t,7r~ 1 (/T''), jt,7r~ 1 (Z/'")} the decomposition (TT8|) holds 
with 

5" = 2^ a i,7r'(i)) ^' — 2_^ a i,ir'(*)) ^ 2^ a i,7rt(j) and T ? = 2_^a ij7r }(j). 
igz «ei iez iex 



Since X is a set of size at most 4, by (fl9l) we find that there exists a coupling of the 
centered variable Y' to a random variable having its zero bias distribution with the distance 
between the two bounded by C of ff24|) . Theorem 11.11 now obtains. 

Remark 4.1 Letting \\a\\ = maxi<ij< n | a^- — di.\, note that for n > 3 the variance o\ in 
|13|) can be bounded as 

2r? 2 

2^1 V II i.O ii iiO 

o A < \\a\\ < 6n\\a\\ . 

A - n _ 1 M II - II II 

Thus, bound derived from (TJ|), /or instance, of Theorem li.il can fre upper bounded in terms 
of \\a\\ as 

P(|F-/i A | > i) < 2exp ' 



6n||a|| 2 + 16||a||t/ 
Remark 4.2 In OV t/ie concentration bound 

n\Y-» A \>t)<2^(-^^ (25) 

o/ £/ie Hoeffding statistic Y is shown under the additional condition that < a it j < 1 for all 
i,j. Under this condition one can take C = 8 in (24\ ), and Theorem \4-3\ yields 

P(|K-^|>,)<2«p(- s ^ ls ). (26) 

10 



In particular, the bound K26)) will be smaller than I[2d*\) whenever t < (2/z A — o~\)/7. 

Moreover, when ajj,l <i,j<n are themselves independent random variables with law 
C(U) having support in [0, 1], then 

Ea 2 A = (n - 1) Var(U) < (n - 1)E(£/ 2 ) < nE(U) = E/x A , 

where the first equality follows by a calculation using the first expression for the variance in 
( dSj ), and applying < U < 1 to yield ~EU 2 < EC/ for use in the strict inequality. Hence if the 
array entries behave as independent random variables on [0,1], If2h)) will be asymptotically 
preferred to K25\) everywhere. 

4.3.2 Permutation distribution constant on conjugacy classes 

We now consider Hoeffding's statistic 

n 

when the distribution of tc is constant over cycle type. This framework includes two special 
cases of note, one where n is a uniformly chosen fixed point free involution, considered 
by Goldstein and Rinott [9] and Ghosh [6], having applications to permutation testing in 
certain matched pair experiments, and the other where n has the uniform distribution over 
permutations with a single cycle, considered by Kolchin and Chistyakov [15], under the 
additional restriction that a^ = biCj. Bounds on the error of the normal approximation to 
Y when the distribution of 7r is constant over cycle type were derived in [TTj . 

We start by recalling some relevant definitions. For q = l,...,n letting c q (^) be the 
number of q cycles of it, the vector 

c(tt) = (ci(vr),...,c n (7r)) 

is the cycle type of ix. For instance, the permutation tt = ((1, 3, 7, 5), (2, 6, 4)) in SV consists 
of one 4 cycle in which 1— >• 3 — > 7 — t-5— > 1, and one 3 cycle where 2 — > 6 — > 4 — > 2, and 
hence has cycle type (0, 0, 1, 1). We say the permutations n and a are of the same cycle type 
if c(7r) = c(cr), and that a distribution P on S n is constant on cycle type if -P(vr) depends 
only on c(n), that is 

F(ir) = P(cr) whenever c(n) = c(o~). 

Clearly a vector c = (ci, . . . , c n ) of nonnegative integers is a cycle type of some permu- 
tation if and only if ^ id = n. Given such a vector c, a special case of a distribution 
constant on cycle type is one uniformly distributed over all permutations having cycle type 
c, denoted U(c). The situations where tt is uniformly chosen from the set of all fixed point 
free involutions, and chosen uniformly from all permutations having a single cycle, are both 
distributions of type U(c), the first with c = (0, n/2, 0, . . . , 0) for even n and the second with 
c=(0,0,...,0,l). 

In general we consider distributions U(c) for which ci(l) = 0, that is, where n has no 
fixed points, as is true for the two special cases of most interest. Noting that under this 
condition no expression of the form an appears in the sum ( 12T|) . let 

2^ ay and a 00 = — -y- ^ a iA 



n-2^"" ~°° (n - l)(n - 2) ^ %r 
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Under the symmetry condition a^- = dji for all i ^ j, from [5], the mean jjl = E[Y] and 
variance a 2 = Var(F) for n > 4 are given by 

/i = (n-2)a 00 and of = ( 7 + —, — ^ttt I Y^Kj ~ 2a io + a 00 ) 2 . (27) 

V n— 1 n n — 3 / *— ' 

When n is even and c is the cycle type of a fixed point free involution, then c<i = n/2 and 
the variance in ( 127)) specializes to 

2 Z (n — Z) ^ — \. . o / v 

a c = 7 TT7 5T l^\ a v ~ 2a ™ + a °°) • ( 28 ) 

(n — l)(n — d) *—' 

When c is the cycle type of permutations that have no two cycles, such as is the case for 
n > 4 and permutations having only one long cycle, the variance in (127)) becomes 



n . 



— -^2(aij -2a i0 + a 00 ) 2 . (29) 

For an array (ckj)i^j let 

a = max |oy — 2aj + a 00 |. (30) 

Theorem 4.4 Let n > 5 and (ajj)" =1 5e an array of real numbers satisfying aij = dji, and 
for some cycle type c without fixed points let it G S n have the uniform distribution U(c). 
Then the bounds of Theorem \l.l\ hold with Y replaced byY — \i and a 2 replaced by a 2 , where 
fj, and a 2 are given by (27\ ), and C replaced by 40a o with a of l[3U\) . 

In the special case when n is even and n is uniformly distributed over involutions without 
fixed points, the same bounds hold with a 2 given by a 2 of [EW , and C = 24a . 



Proof: When it has the U(c) distribution, the constructions of [TT] and [5], similar to those 
used in the proof of Theorem 14.3) provide a zero biased coupling for Y — jjl that satisfies 

\{Y-fj,)*-(Y-fj,)\<A0a o , 

and further show that the constant 40 can be replaced by 24 in the case of fixed point free 
involutions. Theorem 11.11 now obtains. 
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